CDS

Accession Number TCMCG021C41677
gbkey CDS
Protein Id XP_029122717.1
Location join(15730468..15730570,15737252..15737529,15737615..15737990,15742068..15742151,15743849..15744135,15744291..15744347,15748204..15748533,15748615..15748793,15748866..15749148,15751818..15752060,15752149..15752313,15755671..15755811,15755956..15756174,15756411..15756698,15757732..15757938)
Gene LOC105052623
GeneID 105052623
Organism Elaeis guineensis

Protein

Length 1079aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_029266884.1
Definition protein ALWAYS EARLY 3 isoform X4 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category BDT
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGAAAAGTTCCGATGACCAGTATCCTGATCTATTGCAGTATCAGTCAGGTCCAACAGCCTCAGGATGCCTGTCATTATTAAAAAAGAAGCGATCTGGAGATCTATTTCCAGGTAGCAGACCTCGGGCTGTTGGAAAAAGGACACCCCGTGTCCCTGTTTCAAATATGTATGGCAGAGATGATAGGGATAAAATACTTTCCCCAAATAAGCAAGCATTGAAATCTGTTTCAAACACTGCTGATGATGAAGGTGCCCATGTGGCAGCATTGGCTTTAGCAGAGGTTTCTCAGAGAGGAGGCTCACCACAGCTTTCTAGAACACCTGGAAGAAGAGCTGATCACATGAGATCTTCTCCTGCTAAGAGTGGTGAGAAAAAGAACGCTGAGTCAGAGATGGACAGTTCAAAGTTAGTTGGCGCTCAAATGGAGGGTGACTGTCATGAAGGTAGTTTAGGAAGTAGAGAAGCTGAGAATGGAGATTTTGCTAGAGATGCTACTCATCTGATAGAAAATGAAGGTGCTGCAGCAGTTGAAACTCGAAGGAAGGTGAAGAAACTTCAGGGAAAGAGAAAAAAAGTACCAGCAGACATGGAAAATGATCAACTTGATGATGACAGGGAAGCATGCAGTGGTACTGAAGAAGGCATCAATATTAGAAAGATTAAAGATGAAATTGACGGAGAGACTACGGATGGTAAAACTGCAAGAGGATCCAAAAGTTCAAGGAAAAGAAGCCGTCAGCTATTTTTTGGAGATGAAAGCTCTGCCCTTGATGCTCTACAGACACTTGCAGATTTGTCTGTAAATATCTTGCTTCCTACCTCTACTGTTGAATCTGAATCATCTTTCCAAGTTAAAGAAGAGAAAAGAAACATCGACACTGCTGAGGAGCCTAATATACCTGAATCAATGTCAACGACTCATGAGAGAGATCAGTCCAAAGTTTCAGTGAAAAAGGAGACAGGGTATTCTACAAGTGTTGGTACTGATGCTGTTACCAGGAAGAGTGCTAAGCGTGCAAAGTGTTTACGTCATGATGCTAATGTCATTTCTGAAGTGAAGCAGCAAACTTGTGCATGCACTAGTGAGACGCAGAAAAAAAAGCGGAAGTCTTTGACTGGAAAAGCTTCAAAAGGTGAATTTAATAGTGATGCTCAGAAATATGAACCACAAAAGATAGAGGTCTCAGCAGAAGAAGGGAAGAGATTGGTTGGTAAGACTAGACGTGTTAGTCACGTTAGTTCATCACCGAAGCAAGGAAAATTGGTTAAACTACAGGAGAACTCTTCTTCAAGTACTGATCTAGTTAGACCAATCACAGATTCAAATGAAACAATTGTACAGGCTTCTACCACTTGTCCTGGTAACTTGCTAACCAAAAGTAAAAACCGCCGCAAAATAGGTCTACAGAAAGCATGGGCATTGAAGGAATTTAAATCCAATGAGAGTGCTGTAGGCGATCGTCCTGATAAGTACTTACATCCTGTCAACAGGGGGGTGGTTGATCTCAAGGAAAAACTTTCTCACTGCTTGTCTTCTCGAATGTTGCGGAGATGGTGTATGTTTGAGTGGTTTTACAGTGCAATAGATTATCCTTGGTTTGCCAAAAGTGAGTTTGTAGAGTACCTAAATCATGTGAGATTGGGCCATGTGCCAAGGCTAACTCGTATTGAGTGGGGCGTGATACGAAGTTCTCTTGGAAAGCCACGTAGGTTGTCAAAACAGTTTTTGCAGGAAGAAAGAGAGAAGCTTGAGCAATATCGTGAATCAGTTAGGAAGCATTATGCTGAACTTCGAGCTGGTGTTAGAGAAGGACTCCCAACAGATCTGGCTCAGCCTTTATCAGTTGGGCAACGTGTTATTGCTTGTCATCCCAAAACAAGAGAAATTCATGATGGAAGCATTCTGACTGTTGACCGGAACCGGTGCAGGGTTCAATTTGATCGGCCTGAATTAGGGGTTGAGCTTGTGATGGACATCGACTGCATGCCACTGAACCCATTGGAAAATATTCCTGAAGCACTTAGAAGACAGAATATTGTTGCGAATAAATTTTGCACGAGCTTCGCAGATACAAAGCTAGAAGACGGATCTAAGGAGTGGAAAATTGGAGGCTCCATGAAGTTTGCTCCAGCTGAGAGCTTGGAGATCACAAATGGGTCTTCTAGTATTGCTTCTTCTAGTTATCCGATGCATACCTTAATGAAGCAGGCAAAGGGGGACACAATTGATGCCATTGTACAGGCTAAAGCTACTGTAAATGAAGTTGCTGTTGCTGCACAACAGGCAATGTACAGTCAACCTTGTACATTGTCACAAATACAAGAACGAGAAGCTGACATAAGAGTCCTTGCAGAGTTGTCACGTGCCCTTGATAAAAAGGAAGCTCTGCTCATGGAACTGAGACACATGAATGAAGAAGTTTCTGGAAAGCAAAGGGATGGTGATGCCATTAAAGATTTGGAGCATTTTAGAAAGCAATATGCTATGGTGCTTGTGCAGCTAAGAGATGCCAACGATCAGGTTGCTTCGGCCTTGCTCTCTTTGAGGCAACGCAACACGTACCATGGGAATTCAACACATGCATGGGTTAGACCCATTGAGAATTCGGGGGGGCCTGCTGGACCTGCAGACTCTTGCAATTCATCAGCTTTTCTCAATCAGGATTCAGGATCTCATGTAACTGAGATTGTTGAAAGTTCAAGGCGGAAAGCAAGAACGGTAGTTGATGCTGCTGTGCAGGCTATGTGTGCTTTGAAAGAAGGAGAAGATGCTTTTGTCAAGATTGGAGAGGCTTTAGATTCTGTAAACAGCCGCATTTCTGGACCTGGTTCTGGCGTACTTGGAGTAAGACGTAATCCTCCTGATCCTGGACATGGCGGTTCAGCATATCAAGATCATACAACATCATGCATGCCTGAGGCAACAGCAAGTCATGCTAGTCCAAAACCCCATCTTTCTTCTGATTCAGAGATCCAACTTCCATCAGATCTTATTTCATCATGTGTTGCTACATTGCTCATGATACAGACCTGCACTGAGAGACAATGCCCACCTGCCGAGATTGCGCAGATTCTTGATTCTGCAGTCGCAAGTCTGCAGCCATGTTGTCCGCAGAACCTTCCAATTTACAGGGAGATAGAGACATTTATGGGCATCATTAAGAACCAAATGTTGGCACTGATACCCACTCCAAGCATCATACCACCTGTAGAGGTTCCCATTGTGCAAAAATGA
Protein:  
MSKSSDDQYPDLLQYQSGPTASGCLSLLKKKRSGDLFPGSRPRAVGKRTPRVPVSNMYGRDDRDKILSPNKQALKSVSNTADDEGAHVAALALAEVSQRGGSPQLSRTPGRRADHMRSSPAKSGEKKNAESEMDSSKLVGAQMEGDCHEGSLGSREAENGDFARDATHLIENEGAAAVETRRKVKKLQGKRKKVPADMENDQLDDDREACSGTEEGINIRKIKDEIDGETTDGKTARGSKSSRKRSRQLFFGDESSALDALQTLADLSVNILLPTSTVESESSFQVKEEKRNIDTAEEPNIPESMSTTHERDQSKVSVKKETGYSTSVGTDAVTRKSAKRAKCLRHDANVISEVKQQTCACTSETQKKKRKSLTGKASKGEFNSDAQKYEPQKIEVSAEEGKRLVGKTRRVSHVSSSPKQGKLVKLQENSSSSTDLVRPITDSNETIVQASTTCPGNLLTKSKNRRKIGLQKAWALKEFKSNESAVGDRPDKYLHPVNRGVVDLKEKLSHCLSSRMLRRWCMFEWFYSAIDYPWFAKSEFVEYLNHVRLGHVPRLTRIEWGVIRSSLGKPRRLSKQFLQEEREKLEQYRESVRKHYAELRAGVREGLPTDLAQPLSVGQRVIACHPKTREIHDGSILTVDRNRCRVQFDRPELGVELVMDIDCMPLNPLENIPEALRRQNIVANKFCTSFADTKLEDGSKEWKIGGSMKFAPAESLEITNGSSSIASSSYPMHTLMKQAKGDTIDAIVQAKATVNEVAVAAQQAMYSQPCTLSQIQEREADIRVLAELSRALDKKEALLMELRHMNEEVSGKQRDGDAIKDLEHFRKQYAMVLVQLRDANDQVASALLSLRQRNTYHGNSTHAWVRPIENSGGPAGPADSCNSSAFLNQDSGSHVTEIVESSRRKARTVVDAAVQAMCALKEGEDAFVKIGEALDSVNSRISGPGSGVLGVRRNPPDPGHGGSAYQDHTTSCMPEATASHASPKPHLSSDSEIQLPSDLISSCVATLLMIQTCTERQCPPAEIAQILDSAVASLQPCCPQNLPIYREIETFMGIIKNQMLALIPTPSIIPPVEVPIVQK